SEQUENCE LISTING 



<110> XIANG, JINHUA 

WUNSCHMANN, SABINA 
SCHMIDT, WARREN 
STAPLETON, JACK T. 

<120> FULL-LENGTH GB VIRUS C (HEPATITIS G VIRUS) RNA TRANSCRIPTS ARE INFECTIOUS 
IN PRIMARY CD4 POSITIVE T CELLS 

<130> IOWA:030US 

<14 0> UNKNOWN 
<141> 2001-04-05 

<140> 60/253,390 
<141> 2000-11-27 

<150> 60/195,597 
<151> 2000-04-06 

<160> 2 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 9395 
<212> DNA 

<213> Hepatitis G virus 
<400> 1 

tgacgtgggg gggttgatcc cccccccccg gcactgggtg caagccccat aaaccgacgc 60 
ctatctaagt agacgcaatg actcggcgcc gactcggcga ccggccaaaa ggtggtggat 120 
gggtggtgac agggttggta ggtcgtaaat cccggtcatc ctggtagcca ctataggtgg 180 
gtcttaagag aaggtcaaga ctcctcttgt gcctgcggcg agaccgcgca cggtccacag 240 
gtgctggccc taccggtgtg aataagggcc cgacgtcagg ctcgtcgtta aaccgagccc 300 
gtcacccacc tgggcaaacg acgcccacgt acggtccacg tcgcccttca atgtctctct 360 
tgaccaatag gtttatccgg cgagttgaca aggaccagtg ggggccgggg gttatgggga 420 
aggaccccaa accctgccct tcccggtggg ccgggaaatg catggggcca cccagctccg 4 80 
cggcggcctg cagccggggt agcccaagaa tccttcgggt gagggcgggt ggcatttctc 540 
ttttctatac catcatggca gtccttctgc tccttctcgt ggttgaggcc ggggccattc 600 
tggccccggc cacccacgct tgtcgagcga atgggcaata tttcctcaca aattgctgtg 660 
ccccggaaga catcgggttc tgcctggaag gcggatgcct ggtggccctg gggtgcacgg 720 
tttgcaccga ccgttgctgg ccactgtatc aggcgggttt ggctgtgcgg cctggcaagt 780 
ccgcggccca gctcgttggg gaactgggga gcctgtacgg gcccttgtcg gtctcggctt 840 
acgtagccgg gatcctgggt ctgggcgagg tttactccgg ggtcctgaca gttggtgttg 900 
cgttgaggcg ccgggtctac ctgatgccca acctgaagtg tgcagtagaa tgtgacgtta 960 
agtggggaag tgagttttgg agatggactg agcagttggc ctccaattac tggattttgg 1020 
aatacctttg gaaagtccca tttgaatttt ggagaggagt gatgagcctg acccctctgt 1080 
tggtttgggt ggccgcattg cttttgctgg agcaacggat tgtcatggtt ttcctgctgg 1140 
tgacgatggc ggggatgttg caaggcgccc ccgcctccgt tttggggtcc cgcccctttg 1200 
actacgggtt gaagtggcag tcatgctcct gcagggctaa cgggtcgcgt attcccactg 1260 
gggagagggt gtgggatcga gggaatgtca cgctcttgtg tgactgcccc aacggcccct 1320 
gggtttgggt cccggccttt tgccaggcgg ttgggtgggg cgaccccatc acccattgga 138 0 
gccacggaca aaaccagtgg cccctatcat gcccccaata tgtctatggg tctgtgtccg 1440 
taacgtgcgt gtggggttcc gtgtcttggt ttgcctcgac cggcggtcgt gattcgaaga 1500 
tcgatgtgtg gagtttggtg ccggttggat ctgccagctg caccatagcc gctctagggt 1560 
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catcggatcg cgacacggtg gttgagctct 
ttctggaccg tcggcctgct tcatgtggca 
ggtcggttag attccctttc catcggtgcg 
aagctgtgcc cttcgtcaac aggacaactc 
aggggagagg caacccggtg cggtcgcccc 
tccgggattc cctgcatttg gtgaaatgtc 
cgttcgggtt cttccccgga gtcccgccca 
tgtctgaggc attgggcgga gctgggctta 
ggtgttcgga gctgatggga cgccgaaatc 
ctggtagacc tgacgggttc atacacgtcc 
acttcatccc tcctccacgc tggttgctct 
tgaagctggc tgaggcacgg ttggtcccgt 
accagttggc ggttctagga ctgccggctg 
cgggccctgc cttgtcatgg tgtttgggcc 
caaacctggt gttgtacttt cggtggatgg 
ggaagctcgc tcggggagct ttcccgctgg 
ggcgcacctc tgtgctcggg gccgagttct 
cggtgttggg ctgggtggtg gccagcgtgg 
tgagcgcagg ggggtggaag cacaaggccg 
aggctgtgcg ccagagggtg gtgcggagcc 
tgacgttcgc ctggtgcttg gcctcataca 
tggccttggt cctcctcttc ggcctgttcg 
tggtctcccg gccctcgtta cggcgactgg 
gcgagaaggc caccaccatc cgactggtct 
ttgaccacat gggctctttc tcgcgcgctg 
ctttggagcc cttgtcattc actaggacgg 
ccctgtcctg cggacagtgc gtcatgggtt 
ttctcatcgg cgtctttcag gatgtgaatc 
cagttgtcat ccgtcggtgc ggaaagggct 
gtagggatcc tgacttacat ccagggaacg 
gcatgggcac atgtctgaat ggcctgctgt 
ccatcgccac gcccgtgggg gcccttaatc 
cggtgtaccc gcttccagat ggggcaactt 
cctgttgggt tattagatcc gacggggctt 
ttgagctgga tgtggccatg gaggtctctg 
tttgcgacaa agggcacgca gtaagaatgc 
ttactgcggc gcgattcact aggccgtgga 
cagaaccccc tccggtgccg gcaaaaggag 
c gggggcggg aaagagcacc cgcgtaccgt 
tgatcttgaa cccgtcggta gctaccgtga 
cggggaaaca ccccagtatt tactgtggcc 
actcgcccct tacgtattcc acttacggaa 
ggggtgtgtc ggtggtcatt tgtgacgagt 
gcattgggcg tgtcagggag ctggcgcgag 
ctgccacccc tcccggatcc ccgatgaccc 
acgtgggaga gatccccttc tatgggcatg 
ggcatctcgt attctgccac tccaaggctg 
ctaggggggt aaatgccatc gcctattaca 
gagacctggt ggtgtgtgct acagacgcac 
ctgtcaccga ttgtgggtta gtggtggagg 
ttaccatctc cctgcgcacg gtgcccgcgt 
gcacgggtag gggcaggtct gggcgctact 
gtgtggtgcg ctcaggtcct gtctggtcgg 
tggaacctga cctgacagca aacctactga 
ccgtcgcagc tgacattggg gaagccgcgg 
tgcatcccga tgttagctgg gcaaaagttc 
ttcagcggac catgtgccgg gaaacactgt 



ccgagtgggg agtcccgtgc gtaacgtgta 1620 
cctgtgtgcg ggactgctgg cccgaaaccg 1680 
gcacggggcc tcggctgaca aaggacttgg 1740 
ccttcaccat aaggggcccc ctgggcaacc 1800 
tgggttttgg gtcctacacc atgaccaaga 1860 
ccacaccagc catagagcct ccgactggaa 1920 
ttaacaactg catgccgcta ggcacggaag 1980 
cgggggggtt ctacgagcct ctggttcgca 2 040 
cggtttgccc ggggtacgca tggctgtcct 2100 
aggggcacct gcaggaggtg gatgcgggca 2160 
tggattttgt atttgtcctg ctctatctga 2220 
tgatcttgct tctgctgtgg tggtgggtga 2280 
tggacgctgc cgtggcgggt gaagtttttg 2340 
ttcccactgt cagtatgata ctaggtctag 2400 
gccctcagcg cctcatgttc ctcgtgttgt 24 60 
cacttttgat ggggatttcg gcgacccgcg 2520 
gcttcgatgt cacattcgag gtggacactt 2580 
tggcttgggc catagcgctc ctgagctcaa 2640 
tgatctatag gacgtggtgt aaagggtacc 2700 
ccctcgggga ggggcgtcct accaagcttc 2760 
tctggccgga tgctgtgatg atggtggtgg 2820 
acgcactgga ctgggccctg gaggagctcc 2880 
cacgggtggt tgagtgctgt gtgatggcgg 2940 
ccaagatgtg cgcaagaggg gcctacctgt 3000 
tcaaggagcg cttgttggaa tgggacgcgg 3060 
actgtcgcat catcagagat gccgcgagga 3120 
tacccgtggt agcacggcgc ggtgatgagg 3180 
atttgcctcc cgggtttgtc ccgactgcac 324 0 
tcctgggggt cacgaaggca gccttgacag 3300 
tcatggtgtt ggggacggct acgtcacgaa 3360 
tcacaacttt ccatggggct tcatcccgaa 3420 
ccaggtggtg gtcagccagt gatgacgtca 34 80 
cgttgacgcc ctgcacttgc caggcggagt 3540 
tgtgccatgg cttgagcaag ggggacaagg 3600 
acttccgtgg ttcgtctggt tcaccggtcc 3660 
tcgtgtcagt gctccactct ggcggcaggg 3720 
ctcaagtacc aacagatgcc aagactacca 3780 
ttttcaagga ggccccgttg tttatgccta 3840 
tggagtacgg caacatgggc cacaaggtct 3900 
gggccatggg cccatacatg gagcggctgg 3960 
atgacaccac tgctttcaca aggatcactg 4020 
ggtttttggc caaccctagg cagatgctga 4080 
gccacagtca tgactcaact gtgttgttgg 4140 
gatgtggagt gcaattggtg ctctacgcca 4200 
agcacccatc aatcattgag acaaaactgg 4260 
gcatacctct tgagcggatg cggaccggaa 4320 
agtgcgagcg cctggcgggc cagttttcgg 4380 
gggggaaaga cagttctatc atcaaagatg 4440 
tatccactgg gtacactggg aacttcgatt 4500 
aggtcgtcga ggtgaccctt gatcccacca 4560 
cggctgaact gtcgatgcag cggcgaggac 4620 
actacgcggg ggtcggcaag gcccctgctg 4680 
cggtggaagc cggtgtgacc tggtacggaa 4740 
gactttacga caactgccct tacaccgcag 4800 
tgttcttttc ggggcttgcc ccgttgagga 4 860 
gcggcgtcaa ctggcccttc ctggtgggtg 4920 
ctcccggccc atcggatgac ccccagtggg 4980 
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caggtctgaa gggcccgaat cctgtcccac tcctgctgag gtggggcaat gatttaccat 5040 
ctaaagtggc cggccatcac atcgtggacg acctggtccg taggctcggg gtggcggagg 5100 
gttacgtccg ctgcgatgcg ggacccatct tgatggtggg cctcgctatt gcggggggca 5160 
tgatctatgc gtcatacacc gggtctctcg tggtggttac agactgggat gtgaaggggg 5220 
gtggcagccc cctttatcgg catggagacc aggccacgcc ccagccggtt gtgcaggtcc 5280 
ccccggtaga ccatcggccg gggggagagt ctgcgccatc ggatgccaac acagtgacag 5340 
atgcggtggc ggccatccag gtggattgcg attggtcagt catgaccctg tcgatcgggg 5400 
aagtgctgtc cttggcccag gctaagacgg ccgaggccta cgcagctacc accaagtggc 54 60 
ttgctggctg ctacacgggg acgcgggccg tccccactgt ttcaattgtt gacaagctct 5520 
tcgccggggg ctgggcggcg gtggtaggcc attgccacag tgtaatagct gcggcagtgg 5580 
cggcctatgg ggcttctagg agccctccat tggctgctgc cgcttcctac ctcatggggt 5640 
tgggcgtcgg aggcaacgcg caaacccgct tagcctccgc tctcctacta ggggccgctg 5700 
ggaccgctct gggcacgcct gtcgtggggt taaccatggc gggcgcgttc atgggaagtg 5760 
ctagcgtctc cccctccttg gtcaccattt tactgggggc cgtggggggc tgggagggcg 5820 
tggtgaatgc ggctagcctt gtcttcgact ttatggcggg gaaactatca tcagaagatc 5880 
tgtggtatgc catcccagtg ctaaccagtc cgggggcagg acttgcgggg atcgccctcg 5940 
ggttggtgtt gtactcagct aacaactctg gcactaccac ttggttgaac cgtctgctga 6000 
ctacattgcc aaggtcctca tgcatccctg acagttactt tcagcaggcc gattactgtg 6060 
acaaggtctc agctgtgctc cgacgcttga gcctcactcg caccgtggtt gccctggtca 6120 
acagggagcc taaggtggat gaggttcagg tggggtacgt ctgggacttg tgggagtgga 6180 
tcatgcgtca agtgcgcatg gtgatggcca gacttcgggc cctctgcccc gtggtgtcat 6240 
tacccttatg gcactgcggg gaggggtggt ccggagaatg gttgttggac ggccatgttg 6300 
agagtcgttg tctttgtggt tgcgtgatca ccggtgatgt tttgaatggg caactcaaag 6360 
atccagttta ctctaccaag ctgtgcaggc attattggat ggggacagtc cctgtgaaca 6420 
tgctgggcta tggcgagacg tcgcctttgc tcgcctcaga caccccgaag gtggtaccat 6480 
tcgggacgtc tgggtgggct gaggtggtgg tgacccctac ccacgttgtg atcaggcgaa 6540 
catccgccta caaactgctg cgccagcaaa tcctgtcggc tgctgttgct gagccctatt 6600 
acgtcgacgg cataccggtc tcatgggacg cggacgcgcg agcgcctgcc atggtctatg 6660 
gccctgggca aagtgtcacc attgacgggg aacgctacac ccttccgcat caactgcggc 6720 
ttaggaatgt ggcgccctct gaggtgtcat ccgaggtgtc cattgacatt gggacggaga 6780 
ctgaagactc agaactgact gaggccgacc tgccgccggc ggctgcagcc cttcaggcta 6840 
tcgagaatgc tgcgagaatt cttgaacctc acatagatgt catcatggaa gattgcagta 6900 
caccctctct ttgtgggagt agccgagaga tgcctgtgtg gggagaagac ataccccgca 6960 
ctccatcgcc agcacttatc tcggttactg agagcagccc agatgagaag accccgtcgg 7020 
tgtcttcctc gcaggaggat accccgtctt ctgactcatt cgaggtcatc caagagtccg 7080 
agacagccga aggggaggaa agcgtcttca acgtggctct ttccgtacta aaagccttgt 7140 
ttccacagag cgatgccaca agaaagctta ccgttaagat gtcatgctgt gttgagaaga 7200 
gcgtaacacg cttcttttca ttgggattga cggtcgctga cgtggcaagc ctgtgtgaga 7260 
tggaaatcca gaaccataca gcctattgtg acaaggtgcg cactccgctt gaattgcagg 7320 
ttgggtgctt ggtgggcaat gaacttacct ttgaatgtga caagtgtgag gctaggcaag 7380 
agaccttggc ttccttctct tacatttggt ctggggtgcc actgacgagg gccactccgg 7440 
ccaagccccc tgtggtgagg ccggttggct ccttgctggt ggccgacacc accaaggtgt 7500 
atgtcaccaa cccggacaat gttgggagaa gagttgacaa ggttaccttc tggcgtgccc 7560 
ctagggttca tgacaaattc ctcgtggact ccatagagcg cgctaagagg gcagctcaag 7620 
cctgcctaag catgggttac acttatgagg aggcaataag gactgtaagg ccacatgctg 7680 
ccatgggctg gggatctaag gtgtcggtca aggacctcgc cacccctgcg gggaagatgg 774 0 
ctgtccatga ccggctccag gagatacttg aagggacgcc agtccccttt actcttactg 7800 
tgaaaaagga agtgttcttc aaagaccgaa aggaagagaa ggccccccgc ctcattgtgt 7860 
tcccccccct ggacttccgg atagctgaaa agcttattct gggagaccct ggacgggtag 7920 
ccaaggcggt gttggggggg gcctacgcct tccagtacac cccaaatcag cgaattaggg 7980 
agatgctcaa actgtgggaa tcaaagaaga caccatgcgc catctgtgtg gacgccacat 8040 
gcttcgacag tagcataact gaagaggacg tggcgctgga gacagagctt tatgccctgg 8100 
cttcagacca tccagaatgg gtgcgtgccc tggggaaata ctatgcctct ggcacaatgg 8160 
taacccccga gggggtgcca gtgggtgaga ggtattgtag atcctcaggg gtcttgacca 8220 
ccagtgcgag caactgcttg acttgctata tcaaggtgaa agccgcctgt gagagggtgg 8280 
ggctgaaaaa tgtctcgctc ctcatcgctg gcgatgactg tttgatcata tgcgaacggc 8340 
ctgtgtgcga tcctagcgac gctttgggca gagccctggc gagctacggg tacgcatgcg 8400 
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agccttcgta tcatgcatca ctggacacgg cccccttctg ctccacttgg ctagctgagt 8460 
gcaatgcaga tgggaaacgc catttcttcc tgaccacgga ctttcggagg cccctcgctc 8520 
gcatgtcgag cgagtacagt gacccaatgg cttcggccat cggttacatc ctcctatacc 8580 
cttggcatcc tatcacacgg tgggtcatca tccctcacgt gctcacctgc gcgtttaggg 8640 
gtggtggcac accgtctgat cctgtgtggt gccaggtaca tggtaattac tacaagtttc 87 00 
cactggacaa actgcctaac atcatcgtgg ccctccacgg accagcagcg ttgagggtta 8760 
ccgcagacac aactaagaca aaaatggagg ctggcaaggt gctgagcgac ctcaagctcc 8820 
ctggcctagc agtccaccgg aagaaggccg gggcattgcg aacgcgtatg ctccggtcgc 8880 
gcggttgggc tgagttggct agggggctgt tgtggcgtcc aggcctgcgg cttccccctc 8940 
cggagattgc tggtatcccc gggggtttcc ccctttcccc cccctatatg ggggtggttc 9000 
atcaattgga tttcacaagc cagaggagtc gctggcggtg gttggggttc ttagccctgc 9060 
tcatcgtagc cctcttcggg tgaactaaat tcatctgttg cggcaaggtc cggtgactga 9120 
tcatcactgg aggaggttcc cgccctcccc gccccagggg tctccccgct gggtaaaaag 9180 
ggcccggcct tgggaggcat ggtggttact aaccccctgg cagggtcaaa gcctgatggt 9240 
gctaatgcac tgccacttcg gtggcgggtc gctaccttat agcgtaatcc gtgactacgg 9300 
gctgctcgca gagccctccc cggatggggc acagtgcact gtgatctgaa ggggtgcacc 9360 
ccggtaagag ctcggcccaa aggccgggtt ctact 9395 



<210> 2 
<211> 2910 
<212> PRT 

<213> Hepatitis G virus 
<400> 2 

Met Ser Leu Leu Thr Asn Arg Phe He Arg Arg Val Asp Lys Asp Gin 
15 10 15 

Trp Gly Pro Gly Val Met Gly Lys Asp Pro Lys Pro Cys Pro Ser Arg 
20 25 30 

Trp Ala Gly Lys Cys Met Gly Pro Pro Ser Ser Ala Ala Ala Cys Ser 
35 40 45 

Arg Gly Ser Pro Arg He Leu Arg Val Arg Ala Gly Gly He Ser Leu 
50 55 60 

Phe Tyr Thr He Met Ala Val Leu Leu Leu Leu Leu Val Val Glu Ala 
65 70 75 80 

Gly Ala He Leu Ala Pro Ala Thr His Ala Cys Arg Ala Asn Gly Gin 
85 90 95 

Tyr Phe Leu Thr Asn Cys Cys Ala Pro Glu Asp He Gly Phe Cys Leu 
100 105 110 

Glu Gly Gly Cys Leu Val Ala Leu Gly Cys Thr Val Cys Thr Asp Arg 
115 120 125 

Cys Trp Pro Leu Tyr Gin Ala Gly Leu Ala Val Arg Pro Gly Lys Ser 
130 135 140 

Ala Ala Gin Leu Val Gly Glu Leu Gly Ser Leu Tyr Gly Pro Leu Ser 
145 150 155 160 

Val Ser Ala Tyr Val Ala Gly He Leu Gly Leu Gly Glu Val Tyr Ser 
165 170 175 
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Gly Val Leu Thr Val Gly Val Ala Leu Arg Arg Arg Val Tyr Leu Met 

180 185 190 

Pro Asn Leu Lys Cys Ala Val Glu Cys Asp Val Lys Trp Gly Ser Glu 
195 200 205 

Phe Trp Arg Trp Thr Glu Gin Leu Ala Ser Asn Tyr Trp He Leu Glu 
210 215 220 



Tyr Leu Trp Lys Val Pro Phe Glu Phe Trp Arg Gly Val Met Ser Leu 
225 230 235 240 

Thr Pro Leu Leu Val Trp Val Ala Ala Leu Leu Leu Leu Glu Gin Arg 
245 250 255 

He Val Met Val Phe Leu Leu Val Thr Met Ala Gly Met Leu Gin Gly 
260 265 270 

Ala Pro Ala Ser Val Leu Gly Ser Arg Pro Phe Asp Tyr Gly Leu Lys 
3 275 280 285 

3 Trp Gin Ser Cys Ser Cys Arg Ala Asn Gly Ser Arg He Pro Thr Gly 
290 295 300 

~ ; Glu Arg Val Trp Asp Arg Gly Asn Val Thr Leu Leu Cys Asp Cys Pro 
Z 305 310 315 320 

* : Asn Gly Pro Trp Val Trp Val Pro Ala Phe Cys Gin Ala Val Gly Trp 
325 330 335 

5 Gly Asp Pro He Thr His Trp Ser His Gly Gin Asn Gin Trp Pro Leu 
3 340 345 350 

*jj Ser Cys Pro Gin Tyr Val Tyr Gly Ser Val Ser Val Thr Cys Val Trp 
2 355 360 365 

Gly Ser Val Ser Trp Phe Ala Ser Thr Gly Gly Arg Asp Ser Lys He 
370 375 380 

Asp Val Trp Ser Leu Val Pro Val Gly Ser Ala Ser Cys Thr He Ala 
385 390 395 400 

Ala Leu Gly Ser Ser Asp Arg Asp Thr Val Val Glu Leu Ser Glu Trp 
405 410 415 

Gly Val Pro Cys Val Thr Cys He Leu Asp Arg Arg Pro Ala Ser Cys 
420 425 430 

Gly Thr Cys Val Arg Asp Cys Trp Pro Glu Thr Gly Ser Val Arg Phe 
435 440 445 

Pro Phe His Arg Cys Gly Thr Gly Pro Arg Leu Thr Lys Asp Leu Glu 
450 455 460 

Ala Val Pro Phe Val Asn Arg Thr Thr Pro Phe Thr He Arg Gly Pro 
465 470 475 480 
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Leu Gly Asn Gin Gly Arg Gly Asn Pro Val Arg Ser Pro Leu Gly Phe 
485 490 495 

Gly Ser Tyr Thr Met Thr Lys lie Arg Asp Ser Leu His Leu Val Lys 
500 505 510 

Cys Pro Thr Pro Ala lie Glu Pro Pro Thr Gly Thr Phe Gly Phe Phe 

515 520 525 

Pro Gly Val Pro Pro lie Asn Asn Cys Met Pro Leu Gly Thr Glu Val 
530 535 540 

Ser Glu Ala Leu Gly Gly Ala Gly Leu Thr Gly Gly Phe Tyr Glu Pro 
545 550 555 560 

Leu Val Arg Arg Cys Ser Glu Leu Met Gly Arg Arg Asn Pro Val Cys 
565 570 575 



Pro Gly Tyr Ala Trp Leu Ser Ser Gly Arg Pro Asp Gly Phe lie His 

580 585 590 

Val Gin Gly His Leu Gin Glu Val Asp Ala Gly Asn Phe He Pro Pro 

595 600 605 



fe J Pro Arg Trp Leu Leu Leu Asp Phe Val Phe Val Leu Leu Tyr Leu Met 

^ 610 615 620 

i jj 

£0 Lys Leu Ala Glu Ala Arg Leu Val Pro Leu He Leu Leu Leu Leu Trp 
s 625 630 635 640 

o 

JZ Trp Trp Val Asn Gin Leu Ala Val Leu Gly Leu Pro Ala Val Asp Ala 
O 645 650 655 

'l2 Ala Val Ala Gly Glu Val Phe Ala Gly Pro Ala Leu Ser Trp Cys Leu 
^ 660 665 670 

Gly Leu Pro Thr Val Ser Met He Leu Gly Leu Ala Asn Leu Val Leu 
675 680 685 

Tyr Phe Arg Trp Met Gly Pro Gin Arg Leu Met Phe Leu Val Leu Trp 
690 695 700 

Lys Leu Ala Arg Gly Ala Phe Pro Leu Ala Leu Leu Met Gly He Ser 
705 710 715 720 

Ala Thr Arg Gly Arg Thr Ser Val Leu Gly Ala Glu Phe Cys Phe Asp 
725 730 735 

Val Thr Phe Glu Val Asp Thr Ser Val Leu Gly Trp Val Val Ala Ser 
740 745 750 

Val Val Ala Trp Ala He Ala Leu Leu Ser Ser Met Ser Ala Gly Gly 
755 760 765 



Trp Lys His Lys Ala Val He Tyr Arg Thr Trp Cys Lys Gly Tyr Gin 
770 775 780 



Ala Val Arg Gin Arg Val Val Arg Ser Pro Leu Gly Glu Gly Arg Pro 
785 790 795 800 



Thr Lys Leu Leu Thr Phe Ala Trp Cys Leu Ala Ser Tyr lie Trp Pro 
805 810 815 

Asp Ala Val Met Met Val Val Val Ala Leu Val Leu Leu Phe Gly Leu 
820 825 830 

Phe Asp Ala Leu Asp Trp Ala Leu Glu Glu Leu Leu Val Ser Arg Pro 
835 840 845 

Ser Leu Arg Arg Leu Ala Arg Val Val Glu Cys Cys Val Met Ala Gly 
850 855 860 

Glu Lys Ala Thr Thr lie Arg Leu Val Ser Lys Met Cys Ala Arg Gly 
865 870 875 880 

Ala Tyr Leu Phe Asp His Met Gly Ser Phe Ser Arg Ala Val Lys Glu 
g 885 890 895 

n; Arg Leu Leu Glu Trp Asp Ala Ala Leu Glu Pro Leu Ser Phe Thr Arg 
* 900 905 910 

2 Thr Asp Cys Arg lie lie Arg Asp Ala Ala Arg Thr Leu Ser Cys Gly 
2 915 920 925 

Q Gin Cys Val Met Gly Leu Pro Val Val Ala Arg Arg Gly Asp Glu Val 
930 935 940 

f Leu lie Gly Val Phe Gin Asp Val Asn His Leu Pro Pro Gly Phe Val 
-J 945 950 955 960 

2 Pro Thr Ala Pro Val Val lie Arg Arg Cys Gly Lys Gly Phe Leu Gly 
d 965 970 975 

Val Thr Lys Ala Ala Leu Thr Gly Arg Asp Pro Asp Leu His Pro Gly 
980 985 990 

Asn Val Met Val Leu Gly Thr Ala Thr Ser Arg Ser Met Gly Thr Cys 
995 1000 1005 

Leu Asn Gly Leu Leu Phe Thr Thr Phe His Gly Ala Ser Ser Arg Thr 
1010 1015 1020 

lie Ala Thr Pro Val Gly Ala Leu Asn Pro Arg Trp Trp Ser Ala Ser 
1025 1030 1035 1040 

Asp Asp Val Thr Val Tyr Pro Leu Pro Asp Gly Ala Thr Ser Leu Thr 
1045 1050 1055 

Pro Cys Thr Cys Gin Ala Glu Ser Cys Trp Val lie Arg Ser Asp Gly 
1060 1065 1070 

Ala Leu Cys His Gly Leu Ser Lys Gly Asp Lys Val Glu Leu Asp Val 
1075 1080 1085 
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Ala Met Glu Val Ser Asp Phe Arg Gly Ser Ser Gly Ser Pro Val Leu 
1090 1095 1100 



Cys Asp Lys Gly His Ala Val Arg Met Leu Val Ser Val Leu His Ser 
1105 1110 1115 1120 

Gly Gly Arg Val Thr Ala Ala Arg Phe Thr Arg Pro Trp Thr Gin Val 
1125 1130 1135 

Pro Thr Asp Ala Lys Thr Thr Thr Glu Pro Pro Pro Val Pro Ala Lys 
1140 1145 1150 

Gly Val Phe Lys Glu Ma Pro Leu Phe Met Pro Thr Gly Ala Gly Lys 
1155 1160 1165 

Ser Thr Arg Val Pro Leu Glu Tyr Gly Asn Met Gly His Lys Val Leu 
1170 1175 1180 

lie Leu Asn Pro Ser Val Ala Thr Val Arg Ala Met Gly Pro Tyr Met 
1185 1190 1195 1200 

Glu Arg Leu Ala Gly Lys His Pro Ser lie Tyr Cys Gly His Asp Thr 
1205 1210 1215 

Thr Ala Phe Thr Arg lie Thr Asp Ser Pro Leu Thr Tyr Ser Thr Tyr 
1220 1225 1230 

Gly Arg Phe Leu Ala Asn Pro Arg Gin Met Leu Arg Gly Val Ser Val 
1235 1240 1245 

Val lie Cys Asp Glu Cys His Ser His Asp Ser Thr Val Leu Leu Gly 
1250 1255 1260 

lie Gly Arg Val Arg Glu Leu Ala Arg Gly Cys Gly Val Gin Leu Val 
1265 1270 1275 1280 

Leu Tyr Ala Thr Ala Thr Pro Pro Gly Ser Pro Met Thr Gin His Pro 
1285 1290 1295 

Ser lie lie Glu Thr Lys Leu Asp Val Gly Glu lie Pro Phe Tyr Gly 
1300 1305 1310 

His Gly lie Pro Leu Glu Arg Met Arg Thr Gly Arg His Leu Val Phe 
1315 1320 1325 

Cys His Ser Lys Ala Glu Cys Glu Arg Leu Ala Gly Gin Phe Ser Ala 
1330 1335 1340 

Arg Gly Val Asn Ala lie Ala Tyr Tyr Arg Gly Lys Asp Ser Ser lie 
1345 1350 1355 1360 

He Lys Asp Gly Asp Leu Val Val Cys Ala Thr Asp Ala Leu Ser Thr 
1365 1370 1375 

Gly Tyr Thr Gly Asn Phe Asp Ser Val Thr Asp Cys Gly Leu Val Val 
1380 1385 1390 
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Glu Glu Val Val Glu Val Thr Leu Asp Pro Thr He Thr He Ser Leu 
1395 1400 1405 



Arg Thr Val Pro Ala Ser Ala Glu Leu Ser Met Gin Arg Arg Gly Arg 
1410 1415 1420 

Thr Gly Arg Gly Arg Ser Gly Arg Tyr Tyr Tyr Ala Gly Val Gly Lys 
1425 1430 1435 1440 

Ala Pro Ala Gly Val Val Arg Ser Gly Pro Val Trp Ser Ala Val Glu 
1445 1450 1455 

Ala Gly Val Thr Trp Tyr Gly Met Glu Pro Asp Leu Thr Ala Asn Leu 
1460 1465 1470 

Leu Arg Leu Tyr Asp Asn Cys Pro Tyr Thr Ala Ala Val Ala Ala Asp 
1475 1480 1485 

He Gly Glu Ala Ala Val Phe Phe Ser Gly Leu Ala Pro Leu Arg Met 
1490 1495 1500 

His Pro Asp Val Ser Trp Ala Lys Val Arg Gly Val Asn Trp Pro Phe 
1505 1510 1515 1520 

Leu Val Gly Val Gin Arg Thr Met Cys Arg Glu Thr Leu Ser Pro Gly 
1525 1530 1535 

Pro Ser Asp Asp Pro Gin Trp Ala Gly Leu Lys Gly Pro Asn Pro Val 
1540 1545 1550 

Pro Leu Leu Leu Arg Trp Gly Asn Asp Leu Pro Ser Lys Val Ala Gly 
1555 1560 1565 

His His He Val Asp Asp Leu Val Arg Arg Leu Gly Val Ala Glu Gly 
1570 1575 1580 

Tyr Val Arg Cys Asp Ala Gly Pro He Leu Met Val Gly Leu Ala He 
1585 1590 1595 1600 

Ala Gly Gly Met He Tyr Ala Ser Tyr Thr Gly Ser Leu Val Val Val 
1605 1610 1615 

Thr Asp Trp Asp Val Lys Gly Gly Gly Ser Pro Leu Tyr Arg His Gly 
1620 1625 1630 

Asp Gin Ala Thr Pro Gin Pro Val Val Gin Val Pro Pro Val Asp His 
1635 1640 1645 

Arg Pro Gly Gly Glu Ser Ala Pro Ser Asp Ala Asn Thr Val Thr Asp 
1650 1655 1660 

Ala Val Ala Ala He Gin Val Asp Cys Asp Trp Ser Val Met Thr Leu 
1665 1670 1675 1680 

Ser He Gly Glu Val Leu Ser Leu Ala Gin Ala Lys Thr Ala Glu Ala 
1685 1690 1695 



9 



Tyr Ala Ala Thr Thr Lys Trp Leu Ala Gly Cys Tyr Thr Gly Thr Arg 
1700 1705 1710 

Ala Val Pro Thr Val Ser lie Val Asp Lys Leu Phe Ala Gly Gly Trp 
1715 1720 1725 

Ala Ala Val Val Gly His Cys His Ser Val He Ala Ala Ala Val Ala 
1730 1735 1740 

Ala Tyr Gly Ala Ser Arg Ser Pro Pro Leu Ala Ala Ala Ala Ser Tyr 
1745 1750 1755 1760 

Leu Met Gly Leu Gly Val Gly Gly Asn Ala Gin Thr Arg Leu Ala Ser 
1765 1770 1775 

Ala Leu Leu Leu Gly Ala Ala Gly Thr Ala Leu Gly Thr Pro Val Val 
1780 1785 1790 

Gly Leu Thr Met Ala Gly Ala Phe Met Gly Ser Ala Ser Val Ser Pro 
1795 1800 1805 

Ser Leu Val Thr He Leu Leu Gly Ala Val Gly Gly Trp Glu Gly Val 
1810 1815 1820 

Val Asn Ala Ala Ser Leu Val Phe Asp Phe Met Ala Gly Lys Leu Ser 
1825 1830 1835 1840 

Ser Glu Asp Leu Trp Tyr Ala He Pro Val Leu Thr Ser Pro Gly Ala 
1845 1850 1855 

Gly Leu Ala Gly He Ala Leu Gly Leu Val Leu Tyr Ser Ala Asn Asn 
1860 1865 1870 

Ser Gly Thr Thr Thr Trp Leu Asn Arg Leu Leu Thr Thr Leu Pro Arg 
1875 1880 1885 

Ser Ser Cys He Pro Asp Ser Tyr Phe Gin Gin Ala Asp Tyr Cys Asp 
1890 1895 1900 

Lys Val Ser Ala Val Leu Arg Arg Leu Ser Leu Thr Arg Thr Val Val 
1905 1910 1915 1920 

Ala Leu Val Asn Arg Glu Pro Lys Val Asp Glu Val Gin Val Gly Tyr 
1925 1930 1935 

Val Trp Asp Leu Trp Glu Trp He Met Arg Gin Val Arg Met Val Met 
1940 1945 1950 

Ala Arg Leu Arg Ala Leu Cys Pro Val Val Ser Leu Pro Leu Trp His 
1955 1960 1965 

Cys Gly Glu Gly Trp Ser Gly Glu Trp Leu Leu Asp Gly His Val Glu 
1970 1975 1980 

Ser Arg Cys Leu Cys Gly Cys Val He Thr Gly Asp Val Leu Asn Gly 
1985 1990 1995 2000 
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Gin Leu Lys Asp Pro Val Tyr Ser Thr Lys Leu Cys Arg His Tyr Trp 
2005 2010 2015 

Met Gly Thr Val Pro Val Asn Met Leu Gly Tyr Gly Glu Thr Ser Pro 
2020 2025 2030 

Leu Leu Ala Ser Asp Thr Pro Lys Val Val Pro Phe Gly Thr Ser Gly 
2035 2040 2045 

Trp Ala Glu Val Val Val Thr Pro Thr His Val Val lie Arg Arg Thr 
2050 2055 2060 

Ser Ala Tyr Lys Leu Leu Arg Gin Gin lie Leu Ser Ala Ala Val Ala 
2065 2070 2075 2080 

Glu Pro Tyr Tyr Val Asp Gly He Pro Val Ser Trp Asp Ala Asp Ala 
2085 2090 2095 

Arg Ala Pro Ala Met Val Tyr Gly Pro Gly Gin Ser Val Thr He Asp 
C3 2100 2105 2110 

1% Gly Glu Arg Tyr Thr Leu Pro His Gin Leu Arg Leu Arg Asn Val Ala 

*H 2115 2120 2125 

l\S 

Pro Ser Glu Val Ser Ser Glu Val Ser He Asp He Gly Thr Glu Thr 
= * 2130 2135 2140 

10 Glu Asp Ser Glu Leu Thr Glu Ala Asp Leu Pro Pro Ala Ala Ala Ala 
e 2145 2150 2155 2160 

~* Leu Gin Ala He Glu Asn Ala Ala Arg He Leu Glu Pro His He Asp 
^ 2165 2170 2175 

^ Val He Met Glu Asp Cys Ser Thr Pro Ser Leu Cys Gly Ser Ser Arg 
2180 2185 2190 

Glu Met Pro Val Trp Gly Glu Asp He Pro Arg Thr Pro Ser Pro Ala 
2195 2200 2205 

Leu He Ser Val Thr Glu Ser Ser Pro Asp Glu Lys Thr Pro Ser Val 
2210 2215 2220 

Ser Ser Ser Gin Glu Asp Thr Pro Ser Ser Asp Ser Phe Glu Val He 
2225 2230 2235 2240 

Gin Glu Ser Glu Thr Ala Glu Gly Glu Glu Ser Val Phe Asn Val Ala 
2245 2250 2255 

Leu Ser Val Leu Lys Ala Leu Phe Pro Gin Ser Asp Ala Thr Arg Lys 
2260 2265 2270 

Leu Thr Val Lys Met Ser Cys Cys Val Glu Lys Ser Val Thr Arg Phe 
2275 2280 2285 

Phe Ser Leu Gly Leu Thr Val Ala Asp Val Ala Ser Leu Cys Glu Met 
2290 2295 2300 
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Glu lie Gin Asn His Thr Ala Tyr Cys Asp Lys Val Arg Thr Pro Leu 
2305 2310 2315 2320 



Glu Leu Gin Val Gly Cys Leu Val Gly Asn Glu Leu Thr Phe Glu Cys 
2325 2330 2335 

Asp Lys Cys Glu Ala Arg Gin Glu Thr Leu Ala Ser Phe Ser Tyr lie 
2340 2345 2350 

Trp Ser Gly Val Pro Leu Thr Arg Ala Thr Pro Ala Lys Pro Pro Val 
2355 2360 2365 

Val Arg Pro Val Gly Ser Leu Leu Val Ala Asp Thr Thr Lys Val Tyr 
2370 2375 2380 

Val Thr Asn Pro Asp Asn Val Gly Arg Arg Val Asp Lys Val Thr Phe 
2385 2390 2395 2400 

Trp Arg Ala Pro Arg Val His Asp Lys Phe Leu Val Asp Ser lie Glu 
2405 2410 2415 

Arg Ala Lys Arg Ala Ala Gin Ala Cys Leu Ser Met Gly Tyr Thr Tyr 
2420 2425 2430 

Glu Glu Ala lie Arg Thr Val Arg Pro His Ala Ala Met Gly Trp Gly 
2435 2440 2445 

Ser Lys Val Ser Val Lys Asp Leu Ala Thr Pro Ala Gly Lys Met Ala 
2450 2455 2460 

Val His Asp Arg Leu Gin Glu lie Leu Glu Gly Thr Pro Val Pro Phe 
2465 2470 2475 2480 

Thr Leu Thr Val Lys Lys Glu Val Phe Phe Lys Asp Arg Lys Glu Glu 
2485 2490 2495 

Lys Ala Pro Arg Leu lie Val Phe Pro Pro Leu Asp Phe Arg lie Ala 
2500 2505 2510 

Glu Lys Leu lie Leu Gly Asp Pro Gly Arg Val Ala Lys Ala Val Leu 
2515 2520 2525 

Gly Gly Ala Tyr Ala Phe Gin Tyr Thr Pro Asn Gin Arg He Arg Glu 
2530 2535 2540 

Met Leu Lys Leu Trp Glu Ser Lys Lys Thr Pro Cys Ala He Cys Val 
2545 2550 2555 2560 

Asp Ala Thr Cys Phe Asp Ser Ser He Thr Glu Glu Asp Val Ala Leu 
2565 2570 2575 

Glu Thr Glu Leu Tyr Ala Leu Ala Ser Asp His Pro Glu Trp Val Arg 
2580 2585 2590 

Ala Leu Gly Lys Tyr Tyr Ala Ser Gly Thr Met Val Thr Pro Glu Gly 
2595 2600 2605 
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Val Pro Val Gly Glu Arg Tyr Cys Arg Ser Ser Gly Val Leu Thr Thr 
2610 2615 2620 



Ser Ala Ser Asn Cys Leu Thr Cys Tyr lie Lys Val Lys Ala Ala Cys 
2625 2630 2635 2640 

Glu Arg Val Gly Leu Lys Asn Val Ser Leu Leu lie Ala Gly Asp Asp 
2645 2650 2655 

Cys Leu lie lie Cys Glu Arg Pro Val Cys Asp Pro Ser Asp Ala Leu 
2660 2665 2670 

Gly Arg Ala Leu Ala Ser Tyr Gly Tyr Ala Cys Glu Pro Ser Tyr His 
2675 2680 2685 

Ala Ser Leu Asp Thr Ala Pro Phe Cys Ser Thr Trp Leu Ala Glu Cys 
2690 2695 2700 

Asn Ala Asp Gly Lys Arg His Phe Phe Leu Thr Thr Asp Phe Arg Arg 
2705 2710 2715 2720 



1 Pro Leu Ala Arg Met Ser Ser Glu Tyr Ser Asp Pro Met Ala Ser Ala 
\ 2725 2730 2735 



Z He Gly Tyr He Leu Leu Tyr Pro Trp His Pro He Thr Arg Trp Val 
I 2740 2745 2750 

0 He He Pro His Val Leu Thr Cys Ala Phe Arg Gly Gly Gly Thr Pro 

2755 2760 2765 

Z Ser As P Pro Vai Tr P Cys Gin Val His Gly Asn Tyr Tyr Lys Phe Pro 

1 2770 2775 2780 

a Leu Asp Lys Leu Pro Asn He He Val Ala Leu His Gly Pro Ala Ala 
f 2785 2790 2795 2800 

Leu Arg Val Thr Ala Asp Thr Thr Lys Thr Lys Met Glu Ala Gly Lys 
2805 2810 2815 

Val Leu Ser Asp Leu Lys Leu Pro Gly Leu Ala Val His Arg Lys Lys 
2820 2825 2830 

Ala Gly Ala Leu Arg Thr Arg Met Leu Arg Ser Arg Gly Trp Ala Glu 
2835 2840 2845 

Leu Ala Arg Gly Leu Leu Trp Arg Pro Gly Leu Arg Leu Pro Pro Pro 
2850 2855 2860 

Glu He Ala Gly He Pro Gly Gly Phe Pro Leu Ser Pro Pro Tyr Met 
2865 2870 2875 2880 

Gly Val Val His Gin Leu Asp Phe Thr Ser Gin Arg Ser Arg Trp Arg 
2885 2890 2895 



Trp Leu Gly Phe Leu Ala Leu Leu He Val Ala Leu Phe Gly 
2900 2905 2910 
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